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Abstract —We consider the uplink of a cellular massive MIMO 
network. Since the spectral efficiency of these networks is limited 
by pilot contamination, the pilot allocation across cells is of 
paramount importance. However, finding efficient pilot reuse 
patterns is non-trivial especially in practical asymmetric base 
station deployments. In this paper, we approach this problem 
using coalitional game theory. Each cell has its own unique pilots 
and can form coalitions with other cells to gain access to more 
pilots. We develop a low-complexity distributed algorithm and 
prove convergence to an individually stable coalition structure. 
Simulations reveal fast algorithmic convergence and substantial 
performance gains over one-cell coalitions and full pilot reuse. 

I. Introduction 

Massive MIMO (multiple-input, multiple-output) technol¬ 
ogy has over the last few years emerged from a theoretical 
concept |[Tj to a key solution for future wireless networks [2j. 
This is because it can improve the sum spectral efficiency 
(bit/s/Hz/cell) of cellular networks by orders of magnitude |3|, 
without the need for more spectrum or more base stations 
(BSs). In massive MIMO, each BS is equipped with an array 
of hundreds of active antennas, which are processed coherently 
to improve the signal quality in the uplink and downlink f4i|. 

Massive MIMO systems require channel state information 
(CSI) at the BSs, for example, to separate uplink signals sent 
in parallel by different user equipments (UEs). CSI is acquired 
from uplink pilot signaling. The pilot sequences are precious 
resources in cellular networks since accurate CSI estimation 
requires low interference during pilot transmission (i.e., low 
so-called pilot contamination (5|). Contemporary networks 
have an over-provision of pilots—many more orthogonal pilots 
than active UEs per cell—thus pilot contamination is essen¬ 
tially alleviated by allocating the pilots at random. In contrast, 
massive MIMO networks attempt to schedule as many users 
as possible to achieve a high sum spectral efficiency (SE) ©. 

An efficient and robust way to mitigate pilot contamination 
is fractional pilot reuse, where only a subset of the pilot 
resources are utilized in each cell ©, ©, 0- This is con¬ 
ceptually simple in symmetric networks (e.g., one-dimensional 
cases as in 0 or two-dimensional cases with hexagonal cells 
as in |3|, (6J); one can color the cells in a symmetric pattern 
and divide the pilots so that only cells with the same color use 
the same subset. The pilot clustering in practical asymmetric 
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deployments, where every cell has a unique shape, is non¬ 
trivial and must be optimized for each particular deployment. 
The purpose of this paper is to use game theory to develop an 
algorithm for adaptive pilot clustering, which can be applied 
in cellular networks with arbitrary asymmetric cell geometries. 

Coalitional game models offer structured mechanisms to 
find cooperation between decision makers (players). A charac¬ 
teristic of these mechanisms is their natural implementability 
in a distributed way. With such merits, coalitional game theory 
has found many applications in communication networks; 
see the tutorial 18 ]. A class of coalitional games, called the 
partition form (9|, takes into account the dependencies of the 
players’ utility functions on the overall partition of players into 
distinct coalitions, called coalition structure. The application of 
such game models to determine which BSs (players) cooperate 
with each other is appropriate in this paper as the performance 
of each cell depends on the whole coalition structure. 

The solution of a coalitional game in partition form is a 
coalition structure with specified stability requirements. The 
stability requirements are generally related to the feasible 
transitions from one coalition structure to another. One model 
of stability, called individual stability, restricts a single player 
to move from one coalition to another (TO). Such models, 
which we utilize in this paper, have been applied for channel 
sensing and access in cognitive radio © and for coalition 
formation in the MIMO interference channel [12|. 

In this paper, we assume that each BS has a set of unique 
pilot sequences. We propose a distributed mechanism to find 
the sets of BSs that cooperate to gain access to each others’ 
pilots and thereby serve more UEs. A set of BSs have the 
incentive to cooperate whenever the larger number of UEs that 
can be scheduled in their cells leads to an improvement in their 
individual per-cell average SE. Our mechanism is a coalition 
formation algorithm which converges to an individually stable 
coalition structure under what we define as searching budget 
constraints. In doing so, the complexity of the algorithm is 
controlled. Simulation results reveal fast convergence of the 
algorithm giving considerable performance gains over one-cell 
coalitions and full pilot reuse. 

II. System Model & Sum Spectral Efficiency 

We consider the uplink of a cellular massive MIMO net¬ 
work with L cells, each associated with an index in the set 
C = {1,..., L}. BS j is equipped with an array of M antennas 
and serves K :j UEs. The transmission is divided into frames 
of T c seconds and IT). Hz, such that channel between each UE 
and BS has a constant channel response within a frame, but is 
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Fig. 1. Illustration of a massive MIMO network with L cells and wrap¬ 
around. Each cell contains a BS with M antennas and the colors indicate 
clusters of cells that use the same pilots. 

different between frames. Consequently, each frame contains 
S = T C W C transmission symbols. In each uplink frame, B 
symbols are allocated for pilot signaling and the remaining 
S — B symbols are used for uplink data. The B pilot symbols 
permit B orthogonal pilot sequences; that is, only B UEs 
in the entire network can transmit pilots without interfering 
with each other. In this paper, we study how the L cells can 
share these pilots to maximize the average sum SE. Since pilot 
contamination is mainly a problem in highly loaded networks, 
we assume that there is at least B potential UEs per cell—it 
is up to each BS to decide how many of them that are active. 

Each cell is given a fraction ^ of unique pilot sequences, 
where p for convenience is assumed to be an integer. BS j 
can keep its l / pilots by itself and serve Kj = ® UEs without 
any pilot contamination. Alternatively, it can form a coalition 
with other cells to share the access to each others’ pilots. 

Definition 1 (Coalition structure). A coalition structure C is 
a partition of C, the grand coalition, into a set of disjoint 
coalitions {<Si,..., <Sjv} with U n=1 S n = C and = 0- 

For example, let <f>j(C) denote the coalition in C that BS j 
belongs to. The coalition members have access to p|$j(C)| 
pilots, where |-| denotes the cardinality of a set. Then, BS j can 
serve Kj = j- | <I> ? (C ) UEs, but the drawback is that the cells 
in the coalition contaminate each others’ pilot transmissions. 

Fig. □ gives an example of a cellular network with L = 16 
cells in an area with vertical and horizontal wrap-around. The 
cells have formed four coalitions: green, yellow, red, and blue. 
Since each coalition has four members, each BS has access to 
p pilots and serves -y UEs in each frame. Pilot contamination 
is only caused between cells with the same color. 

A. Average Uplink Spectral Efficiency 

The coalition formation is based on maximizing the SE in 
each cell. The vast majority of SE expressions for massive 
MIMO assumes that each BS serves the same number of UEs 
(cf. 0, 0, (13)). In contrast, the BSs may form coalitions of 
different sizes and thus serve unequal numbers of UEs herein. 
We now generalize the SE expressions from (3) to handle this. 

The UEs are randomly distributed in the serving cell. In a 
certain frame, suppose that zj k £ R 2 is the position of the fcth 


UE in cell l. The channel response lip. £ C M 
UE and BS j is modeled as Rayleigh fading: 

hjjfc ~ Cff(o, dj(zik)lM^, (1) 

where I m is the M x M identity matrix. The function 
d 3 (z) gives the variance of the channel attenuation from a 
UE position z to BS j. The value of dj(z[ k ) is known at 
BS j. The UEs apply power control with the purpose of 
achieving uniform performance and avoid near-far blockage. 
More precisely, the UE at location Zj k uses a transmit power 
of p/dj(zjk) per symbol, where p is a design parameter. The 
resulting signal-to-noise ratio (SNR) at any antenna at the 
serving BS is p/a 2 , where tr 2 is the noise variance. 

Orthogonal pilots are used within each cell, while the 
fcth UE of each cell in a coalition uses the same pilot. 
By correlating the received signals with corresponding pilot 
sequences [13 , the received pilot signal y j k ot £ C M at BS j 
for its UE k is 
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where 7jj k ~ CAf(0, ct 2 Im) is additive noise. Notice that there 
is only interference from one UE per cell in the coalition 
<1 >j (C), and no interference from other coalitions. Based on 
y^ lot , BS j computes the MMSE estimate of h jj k [3]: 
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During uplink data transmission, all BSs are active and the 
received signal y 7 £ C M at BS j is 
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where xik £ C is the symbol transmitted by UE k in cell 
l. This signal is normalized as E{|a,’ifc| 2 } = 1, while the 
corresponding UL transmit power is dl ^ lk y , as defined earlier. 
The additive noise is modeled as n, ~ CA r (0. ct 2 I ; \/). 

Linear receive combining is used in massive MIMO to 
separate each UE’s signal from the interfering signals. BS j 
selects a combining vector g j k £ C M for each of its Kj UEs 
and multiply it with the received signals in (4), as g“ fc yj ata , to 
obtain an effective scalar signal. Let H ? = [hpi ... h jjK ] £ 
C M be the estimated channels in cell j. Two typical combining 
schemes are maximum ratio combining (MRC), which obtains 
the highest signal gain by setting 

[g“ RC ... g^ c ] = TT>. (5) 


and zero-forcing combining (ZFC) where the pseudo-inverse 
of Hj is used to suppress intra-cell interference: 

[g,T C - • - g^ f 'j - (TTyiTJ ; TT,. (6) 

The following theorem provides an average sum SE in the 
cells with MRC and ZFC, and generalizes previous results in 
[3j for equal number of UEs per cell. 
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Theorem 1. For a given coalition structure C, a lower bound 
on the average ergodic sum capacity in cell j is 

Vi(C ] = (l - I) f l*j(C)l log, (l + 7 ,J m(c) ) , (9) 

where the interference term /| cheme is given in <|7]) for MRC 
and in ([8]) for ZFC (both at the top of the page). The following 
propagation parameters appear in these expressions: 

/ ^= 1 - 2 ' ,10) 

Proof: The proof is along the lines of the proofs of 

Lemma 2 and Th. 1 in 0 and are omitted for breviety. ■ 

The factor (1 — ^) in ([9]) is the pilot signaling overhead, 
while second factor is the number of active UEs in cell 
j: Kj = -^|<E>j(C)|. The interference terms and 

lJ jFG (C) have intuitive interpretations. The first part in both 
expressions is the pilot contamination and is only impacted by 
the cells that have formed a coalition with BS j. The second 
part is the conventional inter-user interference (from all cells). 
MRC suppresses this part by the full array gain of M, while 
ZFC cancels out part of the interference and suppresses the 
remaining interference by a reduced array gain of M — Kj. 

The parameter p';^ in ( [TO} is the average ratio between the 
channel variance to BS j and the channel variance to BS l, 
for a UE in cell l. Along with its second-order moments, Pj, , 
these parameters characterize the network topology. Notice 
that = p^fj = 1 , while the values get smaller as further 
cell j and cell l are apart. In general, if) 1 pj^ for j 7 / l. 

The average sum SE Uj(C) in (|9j, the utility function of 
BS j in the remainder of this paper, should preferably be as 
large as possible^ There are thus L utilities which depend 
on the combining scheme (e.g., MRC or ZFC) and on the 
coalition structure C. Since the number of possible coalition 
structures equals the /.th Bell number, which has a faster 
growth than exponential with L, finding a globally optimal 
pilot assignment is hard. Therefore, we formulate next the 
design problem as a coalitional game to provide a distributed 
and efficient algorithm. 

'Note that maximizing the sum SE might lead to operating points with 
many active users and low SE per user, but this is still beneficial for all users 
as compared to time-sharing where each user is only active part of time but 
exhibit a higher SE when being active. 


III. Coalitional Game 


Cooperation between the BSs can be analyzed using coali¬ 
tional games ©• Since the average SE of each cell, given in 
Theorem |T] depends on the coalition structure (Definition [TJ, 
we need to study the coalitional game in partition form 19], 
which we formulate by the tuple (£, (qj)j & c, iUj)jec)- Here, 
the set of players corresponds to the set of BSs C. Each player 
in C is endowed with searching budget qu £ N which limits 
the number of searches he can perform to find a coalition to 
join. The utility of a player j is formulated to be 


Uj(C, rjj) 


Uj{C) if ip < qj 

0 otherwise ’ 


( 11 ) 


which is specified by its SE Uj whenever a player j has not 
exhausted its searching budget qj where rjj £ N represents the 
number of searches player j has already performed. 

From Theorem |T] the utility of cell j depends on which 
members are in its coalition 'I’j (C) through the pilot contami¬ 
nation term as well as the interference term determined by the 
structure of the coalitions forming outside ( l> t (C). Therefore, 
so-called externalities exist. Specifically, our game belongs to 
the category of negative externalities, since the merging of 
coalitions reduces the utility of all coalitions not involved in 
the merging due to the increased number of scheduled users 
and thereby the increased interference. 

We adopt the game theoretic assumptions which implies that 
each player’s behavior follows the maximization of his utility 
based on the discovery of profitable opportunities [ 141. Such 
behavior is important for distributed coalition formation which 
we specify and discuss next. 


A. Coalition Formation 

Coalition formation represents the dynamics which lead to 
stable coalition structures. We use a coalition formation model 
from m in which a single player is allowed to leave its 
coalition and join another. 

Definition 2 (Deviation). A cell j £ C leaves its current 
coalition &j(C) to join coalition S £ CU{0}. In doing so, the 
coalition structure C changes to C s . We capture this change 
in the coalition structure by the notation C s £^— C. 

According to individual stability |ilO), a deviation is admis¬ 
sible if a player strictly improves his performance by leaving 
its coalition to join another coalition. In addition, the members 
of the coalition which he joins should not reduce their utility. 




















Definition 3 (Admissible deviation). A deviation C s C 
is admissible ifUj(C s ,r]j) > Uj(C,r]j), and Uk(C s ,t]k ) > 
Uk(C,t]k), for all k € S. 

The requirement imposed through the admissible deviation 
is suitable in our setting due to the fact that each cell 
exclusively owns a set of pilots. Thus, any BS which wants 
to join a coalition by sharing its pilots with its members must 
first ask their permission. 

Based on the player’s deviation model, we utilize the 
following stability concept defined in fTO) . 

Definition 4 (Individual stability). A coalition structure C is 
individually stable if there exists no j £ C and coalition S 
such that deviation C s C is admissible. 

The coalition formation algorithm which leads to individ¬ 
ually stable coalition structures is described in Algorithm [T] 
The coalition structure is initialized with singleton coalitions, 
corresponding to the noncooperative state in which no pilots 
are shared between the cells. Note that the algorithm can be 
initialized with any coalition structure. A BS j is selected at 
random to check if a deviation is profitable. Based on the 
local knowledge of the current coalition structure C t and the 
propagation parameters, BS j can calculate its utility in ( fTTj ) 
if it joins other coalitions. Then, BS j selects a coalition 
at random in which it would profit by joining (Line 3) and 
asks its members for permission to join (Line 4). Here, we 
assume that the BSs are able to communicate and exchange 
such application-type messages. Each BS in a coalition can 
calculate its utility locally for the case the asking BS enters 
the coalition. In Line 5, the number of searches by BS j 
is incremented. If the deviation is admissible (Definition [3}, 
then it joins the coalition (Line 7) and the coalition structure 
changes accordingly (Line 8 ). Coalition formation stops when 
no deviations take place anymore. 

Theorem 2. Algorithm [7] converges to an individually stable 
coalition structure with an upper bound on the number of 
deviations as t < Ylj^c 7 ,r 

Proof: The convergence of Algorithm's guaranteed due 
to the searching budget incorporated in the utility function 
in ( fTT) leading to t < Qj- The stability result follows 

from iterating over all deviation opportunities until individual 
stability (Definition [4| is satisfied. ■ 

IV. Simulations 

In the simulations, we consider S = 400 transmission 
symbols (e.g., T c = 4 ms and W c = 100 kHz), SNR = ^ 
= 5 dB, and a pathloss exponent of 3. Each BS owns 10 pilot 
sequences that are orthogonal to the pilots of all other BSs. 
Accordingly, we set the total number of pilots as B = 10 L 
with L being the number of BSs. For coalition formation, the 
searching budget of BS j is set to qj = 100 for all j £ C. We 
obtain the average performance from 10 3 uniformly random 
BS deployments with uniform user distributions in each cell 
and a wrap-around topology, as exemplified in Fig. [I] 


Algorithm 1 Coalition formation algorithm. 

Initilize: t = 0, Co = {{1}, • ■ •, {iT}}, Vj = 0, j £ C 

l: repeat 

2 : for all BSs j £ C do 

3: for all S £ C t s.t. Uj(Cf,rij) > Uj(C t ,rij) do 

4: Ask members of S for permission to join; 

5: Increment searching factor r/j = r/ 7 4-1; 

6 : if deviation Cf Ct is admissible then 

7: Leave and join S\ 

8 : Update C t +1 = Cf ; 

9: t = t -f- 1; 

10 : Go to Line 2; 

11 : until No cell deviates 


In Fig. [2j the average SE with MRC is plotted for different 
number of antennas at the BSs. Two scenarios are selected 
for which we ensure the same BS density of 25 BSs per 
km 2 by appropriately choosing the region area the cells are 
deployed in. For L = 7, finding the optimal coalition structure 
by exhaustive search is computationally possible. The other 
scenario is for L = 20 BSs. The associated performance using 
the ZFC scheme in the same scenarios is provided in Fig. [3] 

From Fig. [2] the performance of coalition formation and 
the grand coalition (i.e., all BSs use all pilot resources) can 
be observed to be close to optimal in the case of 7 cells. The 
grand coalition outperforms the proposed coalition formation 
in the case of 7 cells when M > 600. Though the practical 
range of M is less than 500, the intersection point of the grand 
coalition and coalition formation curves generally depends on 
the choice of B. For fixed B, the high performance of the 
grand coalition with very large M can be explained through 
the vanishing interference terms in the utility functions in 
Theorem [I] Consequently, it is efficient to schedule as many 
UEs as possible in all cells, achieved by the grand coalition, 
in order to maximize the pre-log term in the average SE 
expression when M is very large. In the ZFC scheme in 
Fig. [3] for L = 7, the grand coalition is more efficient than 
the proposed coalition formation when M > 800 for similar 
reasons. Here, it is observed that noncooperation outperforms 
the grand coalition with ZFC when the available degrees of 
freedom are small. In the case of 20 cells in which B = 200, 
coalition formation provides performance gains compared to 
the other schemes in the selected range of M. 

The corresponding average coalition sizes for the plots in 
Fig.[2]and Fig.[3]are shown in Fig. [4] Similar average coalition 
sizes can be observed for the same number of BSs for MRC 
and ZFC. Evidently, the grand coalition is rarely reached with 
coalition formation. The complexity of coalition formation 
is reflected by the average number of searches by the BSs 
illustrated in Fig. [5] This number is incremented in Line 5 in 
Algorithm |T] Although, we set the searching budget to 100 in 
the simulations, the average number of searches is very low 
and about a quarter of the number of BSs. This measure does 
not show significant dependence on the number of antennas. 
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Fig. 2. Average spectral efficiency with MRC at the BSs. 


Fig. 3. Average spectral efficiency with ZFC at the BSs. 


V. Conclusion 

A distributed algorithm was proposed for pilot allocation 
in the uplink of cellular massive MIMO networks of arbitrary 
shape. By assuming that each cell has a few unique pilots and 
can form coalitions with other cells to share pilot resources, 
a coalitional game in partition form was formulated. Each 
BS wants to maximize the average SE in its cell, taking 
pilots, CSI quality, and interference into account. The proposed 
mechanism has low complexity and provides performance 
gains compared to one-cell coalitions and full pilot reuse 
schemes. The solution is applicable also in the downlink, by 
capitalizing on uplink-downlink duality |3j. Future work will 
further analyze the performance dependencies on the total pilot 
budget as well as on the number of cells and their geometries. 
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